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Abstract 

A tournament sequence is an increasing sequence of positive integers (ii, £21 • • •) 
such that t\ = 1 and tj+i < 2ij. A Meeussen sequence is an increasing sequence of 
positive integers (mi, m2, • • •) such that mi = 1, every nonnegative integer is the 
sum of a subset of the {mj}, and each integer rrii — 1 is the sum of a unique such 
subset. 

We show that these two properties are isomorphic. That is, we present a 
bijection between tournament and Meeussen sequences which respects the natural 
tree structure on each set. We also present an efficient technique for counting the 
number of tournament sequences of length n, and discuss the asymptotic growth 
of this number. The counting technique we introduce is suitable for application 
to other well-behaved counting problems of the same sort where a closed form or 
generating function cannot be found. 

MSC: 11B99 (Primary), 05A15, 05A16 (Secondary). 
* Partially supported by an NSF Mathematical Sciences Postdoctoral Research Fellowship 
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1 Introduction 

An infinite tournament sequence T is an infinite sequence of positive integers T = 
(ti,t 2 , ■ ■ ■) such that 

• ti — 1 and ti < t i+ i < 2ti for i = 1, 2, . . . 

For example, the first infinite tournament sequence in lexicographic order is ti = i, and 
the last is ti = 2 i ~ 1 . A finite tournament sequence T — (t l5 . . . , t n ) is a truncated infinite 
tournament sequence. 

An infinite Meeussen sequence M is an infinite sequence of positive integers M = 
(m 1? m 2 , . . .) such that 



• mi = 1 and rrii < m^+i for % — 1, 2, . . ., 

• Every nonnegative integer is the sum of a subset of the {mj, and 

• Each integer m; — 1 is the sum of a unique subset of the {m^}. 

For example, the first infinite Meeussen sequence in lexicographic order is = fi+i, 
the (i + l)st Fibonacci number, and the last is m; = 2 l ~ l . A finite Meeussen sequence 
M = (mi, . . . , m n ) is a truncated infinite Meeussen sequence. We will see that this is 
equivalent to requiring that every integer between 1 and YH=i m i ls the sum of a subset 
of the {mi}. 

We present a bijection {T} <-> {M} between these two types of sequences. The 
bijection is defined in Section |2|; it preserves the length of the sequence and respects lex- 
icographic ordering. It also acts in a surprising way on sequences with certain recurrence 
relations, as discussed in Section ||. 

Counting finite tournament (or equivalently Meeussen) sequences of length n is 
straightforward (0, sequence A008934), but if done in the obvious way takes time 
exponential in n. In Section [| we present an efficient polynomial-time algorithm for 
producing the numbers. The technique is suitable for application to other well-behaved 
counting problems of the same sort where a closed form or generating function cannot 
be found. We also discuss the asymptotic growth, proving that the log 2 of the number 
of sequences of length n is (™) — log 2 (n!) + 0(log(n) 2 ). 

Finite tournament sequences were studied in Q under the name "random knock-out 
tournaments." A sequence (t\, . . . ,t n ) represented a tournament of n rounds beginning 
with 1 + ^2 ti players; in the first round 2t n players are paired off randomly and the t n 
losers are eliminated, leaving a tournament corresponding to (t 1; . . . ,t„_i). The paper 
concerns the probabilities of certain pairings occurring in such a tournament. 

The count which first appeared in || was performed by M. Torelli, who found the 
notion of a tournament sequence useful in his investigation of sequences with certain 
properties relating to Goldbach's conjecture |10| . Tournament sequences also appear 
independently in the work of J. Shallit, where they are the possible subword complexities 



of infinite non-periodic bit strings [11 
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The observation that the beheaded Fibonacci sequence satisfies the property claimed 
above was made by Wouter Meeussen [private communication, 1999] , and we here name 
sequences with this property Meeussen sequences in his honor. Part of their definition 
is similar to that of so-called regular sequences |J, in which each term is a partial sum 
of preceding terms, which arise in the study of finite probability measures. The unique 
represent ability condition is reminiscent of 1-additive sequences (see [|7|, for example), 
but the precise form of the condition seems new. 

The authors would like to acknowledge W. Meeussen for suggesting the question, 
and N. J. A. Sloane for his Encyclopedia of Integer Sequences ||, which led us to notice 
the coincidence. Thanks also to J. Polito for useful conversations, J. Shallit for helpful 
comments on an earlier draft of this work, and D. Knuth for excellent suggestions about 
the asymptotics questions. 



2 An Isomorphism on Trees 

In this section we define a map which sends any tournament sequence, finite or infinite, 
to a Meeussen sequence of the same length. The set of all sequences of either type can 
naturally be viewed as a rooted tree: the nodes on level n of the tree correspond to 
the sequences of length n, and the parent of the sequence (s\, . . . , s n ) in the tree is the 
sequence (si, . . . , s n -i)- Our map is an isomorphism on the tree structures of the two 
types of sequences. Figure [l] shows how the beginnings of these trees look; the node for 
(si, . . . , s n ) has the label s n written on it. 

The definition of a tournament sequence (ti, . . . ,t n+ i) says that can have any 
value between t n +l and 2t n . Therefore the tree of tournament sequences has a convenient 
local description: the top node is labelled 1, and any node labelled k has k children, 
with labels k + 1, k + 2, . . . , 2k, respectively. Trees with such local descriptions have 
been called generating trees, and were introduced in ||. They have been championed 
by J. West ( [0 , |13[ ) , who has used them to study pattern-avoiding permutations, and 



by Barcucci et. al, who applied them to the enumeration of combinatorial objects 
see and references therein for more on the subject. In West's transparent notation, 
the tree of tournament sequences is 

Root: (1) 

Rule: (k) — > (k + l)(k + 2) . . . (2k) 

We will prove that the tree of Meeussen sequences is isomorphic to the tree of tour- 
nament sequences by showing that it also has the property that if a node has k children, 
then those children have k + 1, k + 2, . . . , 2k children, respectively. Since this tree 
structure clearly has no automorphisms, we conclude that there is a unique bijection 
between the two trees, and therefore a unique bijection between tournament sequences 
and Meeussen sequences which respects the ideas of extending or truncating a sequence. 

To better understand Meeussen sequences, we introduce some notation. 
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4 5 6 5 6 7 8 

5678 678910 789101112 6 7 8 9 10 789101112 8 9 10 11 12 13 14 910111213 14 1516 



5 6 7 

8 10 11 12 8 9 11 12 13 8 9 10 12 13 14 



5 6 7 8 

9 10 111213 9 10 11 12 13 14 9 10 111213 14 15 9 10 11 12 13 14 15 16 



Figure 1: The top five rows of the trees of tournament and Meeussen sequences. 

Definition 1 For A = (a±, a 2 , . . .) an integer sequence, finite or infinite, define: 

1. r(A) to be the set of integers which are representable as + . . . + ai n for some 
a h , . . . , a in in A, i x < • ■ • < i n , and 

2. ur(A) to be the set of integers so representable in exactly one way. 

For example, if A = (1, 2, 3) is our sequence, then r(A) = {0, 1, 2, 3, 4, 5, 6}, and ur(A) = 
{0,1,2,4,5,6}, where 3 is omitted because it can be represented as 3 and as 1 + 2. 
An infinite increasing integer sequence M = (mi,m2, . . .) with mi = 1 is Meeussen if 
r(M) = Z> and m 8 - 1 G ur(M) for all i. 

Given a finite Meeussen sequence M = (m 1? . . . , m n ) which we wish to extend, we 
must certainly pick m n+ i to be u + 1 for some element u G ur\M) with u > m n . We 
say such choices of u are candidates. For example, with M = (1,2,3), there are three 
candidates 4, 5, 6 for — 1, so m 4 must be one of 5, 6, 7. 

Our claim that the two trees are isomorphic then reduces to the following: 

Proposition 2 Suppose M = (m 1; . . . ,m n ) is a finite Meeussen sequence, and there are 
k candidates for m„ +1 — 1, which we designate U\ < u 2 < ■ • • < Uk- Then for each j , 
1 < j < k, the extended sequence with m n+ i = Uj + 1 is also Meeussen, and has k + j 
candidates for m n+2 — 1- 
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Proof: Let S denote the sum Y^7=i mi °f the sequence; note that S is also the largest 
candidate, Uk- Let M' = (mi, . . . ,m n+ i) be the extended sequence we get by choosing 
m n+ i — Uj + 1, and S' = S + m n+ i be its sum. 

First, we can see by induction that r(M') is the entire interval of integers [0, S']. Each 
representable sum in r(M') is an element of r(M) or is m n+ i added to such an element. 
Assume inductively that each of these types forms a single interval, [0, S] and [m n+ i, S'), 
respectively. There is no gap between the two intervals, since m n+ i — 1 = Uj G r(M), 
and the induction holds. 

Note that these two intervals have some overlap [m n+ i, S], possibly empty if we chose 
m n+ i — Uk+ 1 = S + 1. Anything in the overlap can be represented in at least two ways, 
one with m n+ i and one without. Thus the candidates for M', the elements of ur(M') 
larger than m n+ i, are in fact all larger than S. 

Next we observe that both r(M') and ur(M') are invariant under the involution 
1 1— ► S' — t, corresponding to taking the complement of a subset. The overlap [m n+1 , S] 
is similarly invariant and right in the middle, and for each candidate v of M', there will 
be a corresponding member S' — v of ur(M') smaller than m n+1 . 

Similarly, ur(M) has the same property: aside from m, . . . ,iik it contains exactly k 
more uniquely representable numbers, all of the form S — «j. We now know that the 
elements of ur(M) are, in order, S — Uk, ■ ■ ■ , S — U\, u%, . . . , u^. 

We have also concluded that the candidates for M' are exactly the numbers of the 
form u + m n+ i such that u G ur(M) and the total is strictly larger than S. Therefore 
when we chose m n+ i = Uj + 1, the candidates for M' are exactly the k numbers Ui + Uj + 1 
for % = 1, . . . , k and the j numbers (S — Uj) + Uj + 1 for i = 1, . . . ,j. Thus there are 
k + j candidates, as desired. ■ 

Note that we incidentally showed that half of our definition of Meeussen sequences is 
unnecessary. If we always choose the term m n+i to be one more than a representable sum 
from r(M), we argued above that at each finite stage, r(M) is the entire interval [O,^]. 
Thus an infinite sequence M generated this way will automatically have r(M) = Z>o, a 
condition imposed in the original definition. 

Corollary 3 There is a unique isomorphism : {T} — > {M} from the set of tournament 
sequences to the set of Meeussen sequences which preserves the rooted tree structure. 
Moreover, respects the lexicographic ordering on sequences of each type. 

Proof: By Proposition 0, the tree structure of Meeussen sequences is precisely that of 
tournament sequences: both trees start with a node with one child, and if a node has k 
children, then those children have k + 1, k + 2, . . . , 2k children, respectively. Since the 
children of a node are all distinguishable from one another, by virtue of their distinct 
numbers of children, there is a unique bijection between the two trees. 

The nodes in the trees are indexed by finite sequences of each type, so <ft is immedi- 
ately defined for finite sequences. In both trees, the distance from the root determines 
the length of the sequence, so <fi preserves it. Infinite sequences can be thought of as 
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infinite paths in the tree heading away from the root, and therefore the tree bijection 
lets us define in this well. 

It is evident that <fi respects the lexicographic ordering since, in the proof of Proposi- 
tion D, we showed that when extending a sequence M, the larger candidates correspond 
to the nodes with more children, just as in tournament sequences. ■ 

The proof of Proposition |] also gives us a way to calculate 4>(T) for any T = 
(ii,...,i n ) without constructing the full set ur(M), a task that could require expo- 
nential time. We make repeated use of the fact that the candidates for extending 
M = (mi, . . . , m n ) are easily expressed in terms of the candidates for (m 1; . . . , m„_i). 
Letting u(n,k) be the kth smallest candidate for extending (mi, . . . ,m n ), we get the 
following recurrence: 

u(n — 1, t n — t n -i) + 1, and 

J u(n - 1, k - (t n - t n _i)), if k > t n - t n -i 

| S n - u(n - 1, t n - t„_i + 1 - k), if k < t n - t n _i 
where S n — m\ + h m n . 

Beginning with mi = 1 and it(l, 1) = 1, we can quickly calculate each successive term 
of <f){T) in linear time. 



m r 



u(n, k) 



3 Properties of the Bijection 

In the introduction, we stated without proof that the beheaded Fibonacci sequence 
(1,2,3,5,8,13,...) is a Meeussen sequence, and moreover that it is the smallest one 
in lexicographic order, the image of (1, 2, 3, 4, 5,6,.. .) under the map defined above. 
This is a special case of a more general surprising property of 0, which we prove in this 
section. 

Since (1,2,3,5,8,13,...) is, coincidentally, again a tournament sequence, we can 
apply cf) to it as well. This leads to the following computational observation: 

(1, 2, 3, 5, 8, 13, 21, . . .) & (1, 2, 3, 6, 11, 20, 37, . . .) A (1, 2, 3, 7, 13, 25, 48, . . .) 

The middle sequence is the "3-bonacci" sequence beginning (1,2,3) and the last is the 
"4-bonacci" sequence beginning (1, 2, 3, 7), where we say a sequence is /c-bonacci if each 
term (after the first k) is the sum of the previous k terms. Further experimentation 
reveals that, for example, 

(1, 2, 4, 7, 12, 20, 33, 54, 88, 143, . . .) A (1, 2, 4, 7, 13, 24, 44, 81, 149, 274, . . .). 

The first sequence begins (1,2) and thereafter each term is one plus the sum of the 
previous two, while the second sequence is the 3-bonacci sequence beginning (1,2,4), 
with no additive constant in the recurrence. 

Inspired by examples of this type, we make the following observation. 
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Proposition 4 Suppose (ti, . . . , £ n +i) a finite tournament sequence, and suppose that 
for some integers k, c, it happens that both 

t n +i = t n + t n -i + ■ ■ • + tn-fc+i + c and 

= t n _i + t n _ 2 + • • ■ + C n -fe + c - 

T/ien (ti, . . . , i n+ i) ^ (mi, ... , m n+ i) ; u>/iere 

m„ + i = m n + m n -i H h m n -k+i + m n _&. 

While the statement of the proposition is designed to mimic the examples above, 
the hypothesis simplifies to t n+ i = 2t n — t n -k- Since our map respects extending or 
truncating a sequence, we are really just assuming that t n+ i = 2t n — t n ^k for some single 
pair n, k; the effect is purely local. 

To prove the proposition, it would help to have a more concrete relationship between 
terms of the tournament and Meeussen sequences associated to one another by our 
bijection. 

Lemma 5 Let T = (ii,t 2 , •••) be a (finite or infinite) tournament sequence with as- 
sociated Meeussen sequence <f>(T) = M = (mi,m 2 ,...). Write ur{M), the uniquely 
representable sums of M, as {u\ < u 2 < u 3 < ■ ■ ■}. Then: 

1. The ti 'th uniquely representable sum Ut t is rrii — 1, and 

2. The next uniquely representable sum u ti +\ is mi + m 2 + • • • + ?7ij_i + 1. 
Proof: 

1. When i — 1, we check that ti — 1, u\ — 0, and m\ = 1 directly. Now assume 
by induction that the map k i— > 1 + Uk sends ti to for some i. Then it sends 
U + 1, U + 2, . . . to the first, second, . . . number in ur\M) greater than rrii, which 
we showed was the desired value during the proof of Proposition ||. 

2. Consider the truncated sequence M, = (mi, . . . ,mj). The map t i— > (mi + ■ ■ • + 
m,) — t, as we saw in the proof of Proposition 0, is an involution on ur(Mi), and 
takes rrii — 1 to mi + ■ ■ • + mj_i + 1. Certainly nothing in between is uniquely 
representable; this is the "overlap" interval of numbers which can be represented 
either with or without using m«. This in turn means that m i+ i is strictly larger 
than mi + • • • + m^-x + 1, which is therefore in ur(M) since it is in ur(Mi). ■ 

Proof of Proposition |j: Consider M = (mi, . . . ,m n ) with sum S = mi + ■ ■ • + m n 
and ur(M) = {ui < u 2 < u% < ■ ■ ■}. By the first part of Lemma [5], we know that 
m n = 1 + u tn . Therefore ur(M) contains exactly 2t n numbers: %<•••< u tn , which 
are less than m n , and another t n which are their images under the involution t S — t. 
In particular, U2t n -i — S — Ui+i. 
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Now consider what happens when we pick t n+ i = 2t n — t n -k, as supposed by Propo- 
sition f|, and extend M accordingly: 

m n+1 = 1 + u tn+1 by Lemma § 

= 1 + u 2tn ~t n _ k 

= l + S-u tn _ k+1 

= 1 + S — (mi + • • • + m n _fe_i + 1) by Lemma | again, 

= m n + m n _i H h m n _ fc 

The new term of M is the sum of the previous k terms, as claimed. ■ 



4 The Growth of the Tree 

We would like to know the number of tournament or Meeussen sequences of length n, 
which we will designate s(n). Equivalently, we want to know the number of nodes on the 
nth level of the tree shown in Figure [l] (p. where we can count that s(n) = 1, 1, 2, 7, 41 
for n up to 5. Counting the nodes directly takes time exponential in n, and while 
we cannot present a solution in closed form, we can offer an efficient polynomial-time 
algorithm. We would also like to know the asymptotic behavior of s(n) as n gets large. 
The asymptotics reveal that s(n) grows so quickly that its generating function cannot 
be algebraic. 

Exact Counting 

Throughout this section, we consider the nodes to be labelled as in the tree of tournament 
sequences: a node with label (k) has k children, with labels {k + 1), (k + 2), . . . , (2k), 
respectively. Based on this definition, we note that the function c(n, k) counting the 
number of nodes with label (k) in row n of the tree satisfies: 

Recurrence 1 

c(l,k) = 5 k ,i 

k-l 

c(n,k) = c(n — forn>l. 

We could then find the number of nodes on row n by summing c(n, k) for all k up to 2 n_1 . 
The work involved grows exponentially in n, though, so for large n this is impractical. 
We could define a generating function in two independent variables 

g(x, y) = J2 zVc(™, k)=xy + x 2 y 2 + x 3 (y 3 + y 4 ) + ■ ■ ■ 

n,k 
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which, based on Recurrence [I], must satisfy 

/ \ ^y / \ ^y / 2\ 

g(x, y) = xy + - g(x, y) - - g(x, y ). 

l -y 1-2/ 

Rewriting this as g(x,y) = xl+y-1 + xy+y-i ^ 2 )' we can so ^ ve formally by iterated 
substitution to get 

OO n nk 

xy 



g(x, y) = J2 (V 2n ~ !) II CT 2* T ^ _ I 
„~n i,_n x y r y x 



n=0 fc=0 

This seems to offer dim prospects for a nice form for s(n), the coefficient of x n at y = 1. 

Alternatively, one could hope to work with the function d(n, k) which counts the 
number of nth-generation descendents of a node labelled (k). We can count these de- 
scendents by summing the number of (n — l)-generation descendents of the node's k 
children: 

Recurrence 2 

d(l,k) = k forallk>l, 

d(n,0) = for all n > 1, and otherwise, 

2k 

d(n,k) = ^ d(n- 
j=k+i 

The number of nodes on row n of our tree is then s(n) = d(n — 1, 1). Torelli points out 
that this recurrence can be expressed in closed form, by replacing the last line with 

d(n, k) = d(n, k — 1) — d(n - 1, k) + d(n - 1, 2k - 1) + d(n - 1, 2k) 

This alternate version embodies the notion that the tree below a node labelled (k) looks 
just like the tree below a (k — 1), but modified by pruning the branch beginning with the 
child (k) and grafting on branches beginning with (2k — 1) and (2k) instead. However, 
as n increases, either version still involves calculating an exponentially growing set of 
values. 

We offer instead the following technique for calculating the growth of the tree in 
polynomial time. Consider the family of functions p n for n — 1, 2, 3, . . . such that p n (k) 
is the number of nth-generation descendents of a node labelled (k), what we called 
d(n, k) above. Then in the spirit of Recurrence 0, we can get a recurrence relation for 
the functions p n themselves: 

Recurrence 3 

Pi(k) = k, 

2k 

Pn(k) = Yl P»-lO')- 
j=k+l 
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Purists would start the recurrence with po(k) = 1 instead. 

The key observation is that each p n is in fact a degree n polynomial in k, which we 
obtain by symbolic summation of a range of values of p n -i- Recall that the sum ^, =1 j n 
is a polynomial in k of degree n + 1. Our recurrence states that p n (k) is the sum of the 
first 2k values of p n -\ minus the sum of the first k values. Since p n -i is a polynomial in 
k by induction, so is p n . The next few polynomials after pi — k are 

3k 2 + k 7k 3 + 6k 2 + k 105k 4 + 154k 3 + 63k 2 + 6k 
P2 = ~ , Ps = ~ , Pa = , ■ • ■ 

Modern computer algebra packages can generally carry out this type of symbolic 
summation quickly, so we can use this polynomial recurrence directly to find p n , and 
evaluate p n _i(l) to find the number of nodes on level n. For example, in Maple™: 

p := proc(n) option remember; 

if (n=l) then k else sum( p(n-l), 'k'=k+1..2*k ) fi; end; 
s := n -> eval( p(n-l), k=l ); 

Then p (n) returns the polynomial p n in the variable k, and s (n) evaluates 

Pn—l 9>t k — 1, 

giving us the number of nodes on level n of the tree. 

Now that we know that p n is an nth-degree polynomial, we need not calculate the 
polynomial explicitly just to find some of its values. For example, p n is determined by 
its values at the n + 1 points k = 0,1, ... ,n, which we can find (using Recurrence 0) 
once we know p n -\ at k = 0, 1, ... , 2n. We could then fit an interpolating polynomial to 
those points to find other desired values of p n . In this case, another trick presents itself; 
we can use the linear dependence among n + 2 equally-spaced values of a polynomial p 
of degree n: 

n+1 



Va,6: ^(-1)M U )p{a + bi) = 0. 



Combining all of these tricks, we can efficiently calculate the number of nodes on row n 
of our tree as follows: 

Recurrence 4 

d(0, k) = 1 for all k, 
d(n, 0) = for all n > 0, 

d(n,k) = d(n, k - 1) - d(n - 1, k) + d(n - 1,2k - 1) + d(n - 1,2k) 

for k < n, and 



d(n, k) 



n+1 / _|_ 1 \ 

^(-l) " 1 ^™ . Jd(n,k-j) forn<k<2k + 2. 



THE ELECTRONIC JOURNAL OF COMBINATORICS 7 (2000), #R44 



11 



As before, s(n) = d(n — 1, 1) gives the number of nodes on level n of our tree. 

Now that we have a refined computational technique, we would like to evaluate its 
efficiency and justify our statement that it can calculate s(n) in time polynomial in 
n. The difficulty here is that we cannot in good conscience simply count the number 
of arithmetic operations involved in finding d(n — 1,1) using Recurrence f|, because 
the values we encounter grow quickly as n increases. Observe, for example, that s(n) 
certainly grows faster than c n for any fixed constant c, because on row n of the tree, 
every node has at least n (and at most 2 n ~ 1 ) children. 

In this situation, an analysis of algorithmic complexity must take into account the 
magnitude of the numbers involved in arithmetic operations. Bach and Shallit argue 
for what they call the naive bit complexity measure, in which we can calculate a + b and 
ab in time 0(loga + log b) and O (log a log 6), respectively. These time estimates reflect 
the speed of the naive, grade-school algorithms for adding and multiplying two numbers 
with log a and log b digits; the authors argue that these estimates are both asymptotically 
realistic and pragmatic for predicting real-world behavior of computations. 

Theorem 6 The naive bit complexity of calculating s(n) is 0(n 6 ). 

Note that this is not what one might generally call a polynomial-time computation, since 
it is polynomial in n, not in logn, the length of the input. As already mentioned, s(n) 
grows so quickly that it could not even be written down in time polynomial in logn. 

To calculate s(n), we will compute all values d(m, k) for < m < n — 1 and < k < 
2m + 2, in lexicographic order, using Recurrence |j. We first need some bound on the 
size of the numbers we encounter. 

Lemma 7 d(n, k) < 2 n ( n " 1 )/ 2 k n 

Proof: The bound is based on Recurrence |2|, d(n, k) = J2j=k+i d( n ~ 1> j)- For a fixed 
n > 1, we know d(n,k) is an increasing function on positive integers k. Therefore the 
largest of the k terms in the sum is d(n — 1, 2k), and we have d(n, k) < kd(n — 1, 2k). 
Repeating, we get: 

d(n,k) < kd(n-l,2k) < (k)(2k) d{n - 2, 4fc) <••• 

< (k)(2k)---(2 n - 1 k)d(0,2 n k) 

_ 2 n ( n -i)/ 2 fc n 

In the next section we will discuss the asymptotic growth of s(n), which we have just 
bounded by 2(™2 1 ). ■ 

Proof of Theorem^: We will perform the calculation using Recurrence ^, storing partial 
results so we never calculate any value of d twice. 

Fix some n, and suppose we know d(n — 1, k) for all < k < 2n. Calculating d(n, k) 
for all < k < 2n + 2 involves two phases. 
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1. For each k with < k < n, we compute the sum of four numbers. By Lemma |7|, 
the summands are of length 0(n 2 + nlogk), which is 0(n 2 ) since k is small. Thus 
each value of k takes time 0(n 2 ), and the whole phase takes time 0(n 3 ). 

2. For n + 1 < k < 2n + 2, we compute 2^j=i ( — l)^' -1 ^ ( n ^ 1 )d(n, k — j). As above, 
d(n,k — j) has length 0(n 2 ). The binomial coefficient ) has length 0(n), 
since numbers in the nth row of Pascal's Triangle are bounded by 2 ra ; the cost of 
computing it is negligible because we are using all of the top n rows of Pascal's 
Triangle, which we can compute by additive recurrence. 

Then we form each product in time 0(n 3 ) and take their sum in time 0(n 4 ) for 
each k. Thus the whole phase takes time 0(n 5 ). 

Thus passing from n — 1 to n takes time 0(n 5 ), and we can calculate s(n) from scratch 
in time 0(n 6 ). ■ 

In practice, Recurrence (| is easy to implement and seems to perform much better 
than the above analysis suggests for values of n we are interested in. In Maple™: 

d := proc(n,k) 
local j ; 

option remember; 
if n=0 then 1 
elif k=0 then 

elif k<=n then d(n,k-l)-d(n-l ,k)+d(n-l ,2*k-l)+d(n-l ,2*k) 
else add( (-l)~(j-l) * binomial (n+l,j) * d(n,k-j), j=l..n+l ) 

fi 
end; 

s := n -> d(n-l,l) ; 

This code seems empirically to calculate s(n) after having done the work for s(n — 1) in 
time 0(n 2 ) even when n is around 190, when s(n) has over 5000 decimal digits. This is 
the behavior one would expect if multiplication had a constant unit cost and addition 
were free. 

On a modest desktop Pentium II, this computes up to s(30) in under a second, s(85) 
in under a minute, and s(190) in about an hour; a little extra work to avoid computing 
the binomial coefficients multiple times speeds it up even more. We record s(n) for 
1 < n < 22 here. The sequence also appears as entry A008934 in Sloane's On-Line 
Encyclopedia of Integer Sequences . 

1, 1, 2, 7, 41, 397, 6377, 171886, 7892642, 627340987, 87635138366, 
21808110976027, 9780286524758582, 7981750158298108606, 
11950197013167283686587, 33046443615914736611839942, 
169758733825407174485685959261, 1627880269212042994531083889564192, 
29264239787495935863325877024506142042, 
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989901541366810465070950556260422637919176, 
63214893835996484808167529681187283166038800097, 
7643667309922877343580868981767361594845888953165967, . . . 

Asymptotic Behavior 

Now we turn to the asymptotic growth of s(n). We know from Lemma |7j that ("g 1 ) is an 
upper bound for lg s(n), where lg denotes log 2 . We will first prove that a lower bound 

for s(n) is a 2\ 2 ) /(n — 1)! for a certain constant a, and then we will show that lg s(n) is 
asymptotic to (™) —lgn\ + O(logn) 2 . 

We are grateful to Donald Knuth for suggesting the method used here. The technique 
rests on the following observation: 

Theorem 8 (Knuth) Let T be a rooted tree in which we want to know the number of 
vertices on the nth level. Select n — 1 vertices v\, . . . , t> n -i in T by choosing v± to be the 
root and picking Vi + \ uniformly at random from among the children ofvi, of which there 
are deg(fj). Then the expected value 

£(deg(t>i) deg(> 2 ) • • -deg(i; n _i)) 

is exactly the number of vertices on the nth level ofT. 

Proof: See || for a discussion of this technique in greater generality In this case, a 
proof by induction is straightforward: if this technique works for counting the nth level 
of each of k trees Ti, . . . , Tk, then it clearly also works for counting level n + 1 of the 
tree T whose root has k children, the roots of T\, . . . , T^. ■ 

We will apply Theorem [| to the tree of tournament sequences, in which, conve- 
niently, each vertex is already labelled with its degree. This means we can calculate 
s(n) by finding the expected value of the product t±t 2 ■ ■ ■ t„-i, where (ti, t 2 , . . . , £ n -i) is 
a tournament sequence selected at random by setting t% — 1 and picking uniformly 
at random from among ti + 1, ti + 2, . . . , 2^. For the remainder of this section, whenever 
we talk about a distribution for ti, it is implicitly with respect to this way of picking a 
tournament sequence. 

Lemma 9 Let s(n) be the number of tournament sequences of length n. Then 

2© 

where a = (1 - §)(1 - |)(1 -£)•••» .28878837. . . 
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Proof: Observe that there is a natural continuous analogue to the expected value 
E(t\t2 ■ • • = s{n). Consider instead the expected value E(r\r2 • • • r n -i), where 

r\ — 1 and j-j-n is a real number chosen uniformly at random from the interval (r$, 2r,]. 
Equivalently, we are taking random variables ttj (1 < i < n — 2) each with a uniform 
distribution over (1,2], and setting r i+ i = u^i. 

The expected value in the continuous case will give an underestimate of the expected 
value for the discrete version, in which the Ui are distributed the same way but we set 
t i+ i = \u{ti\ . The independence of the various U{ make the expected value easy to 
calculate: 



E(nr2r 3 ■ ■ ■ r„_i) = S((l)(«i)(«i« 2 ) • • • (uiw 2 ■ • -^-2)) 

2 n-l _ ! 2«-2 _ 1 2 2 - 1 



n — 1 n — 2 

2(5) 



Now we show that this lower bound is quite good, by finding the rate of growth of 
the error. We use lg to denote log 2 . 

Theorem 10 lgs(n) = (™) — lgn! + O(logn) 2 

Proof: In the proof of Lemma |9], we calculated E(u^ k ~ 1 ), where = r^i/r^ was 
uniformly distributed over (1,2]. In the original problem, tk and tk+i are integers, so 
the ratio tk+i/tk instead takes on a discrete set of values, each with equal probability: 

where t = tk throughout. We can rewrite this and take advantage of the ability to sum 
consecutive pth powers: 

„[{tk+iX\ (t + 1)" + (t + 2)- + ■ ■ ■ + (t + ty 
E {{—)) = 



< 



tp+ i 

= pn + 2i +0 {¥ 
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The 2 p+1 /p + 1 term is exactly the lower bound we used in Lemma |9], and we now have 
an idea of how much error this introduced: 

£(Mr ,- 1) ( 1+ ^ +0 (^)) 

The expected value now depends on t*., and to bound the error, we need some idea of 
how large we expect tk to be. 

Consider again the continuous analogue used in the proof of Lemma 0. We expect Sk 
to grow exponentially, as c k for some c, so look at the distribution of s k . Taking logs, 
we see that logs]/ k is distributed as the average of k copies of logx on (1, 2], each with 

1 / k 

mean 2 log 2 — 1. Thus as k increases without bound, the distribution of s k converges 
towards a point distribution at 4/e. As we already noted, the £& certainly grow no slower 
than the Sk- We conclude that for any constant c < 4/e and probability p < 1, there is 
a sufficiently large k$ such that Pr(t& > c k ) > p for all k > k$. 

The error we want to bound is the product of the n error terms = l + 1 j7^ + 0( n ^-) 

^ k 

for k = 1, . . . ,n. To simplify bookkeeping, we will instead think about \gs(n) and bound 
the sum of the logs of the error terms. We separate the work into two cases, depending 
on whether k is less or greater than 21ogn. 

When k < 2 log n, the denominator t k may be small, and the error terms log e& may 
be as much as O(logn). Adding up all 21ogn of these terms gives a total which is 
O(logn) 2 , the error term in the statement of our theorem. 

Finally, when k = 21ogn, we know that as long as n is sufficiently large, with high 
probability tk > c 21ogn , which is on the order of n 2 . Thus with high probability, is 
l + 0(l/n), and loge/c = 0(l/n). Since the error terms monotonically decrease, the sum 
of all the terms with k > 21ogn is bounded by ne 2 i og n = 0(1). So taking n sufficiently 
large ensures that the error in the lower bound is concentrated almost entirely in the 
first 2 log n error terms, and we are done. ■ 

Computational evidence based on the actual values of s(n) for n up to 190 indicates 
that the constant needed to make lg s(n) < \T) — lgn! + c(logn) 2 reaches a peak of 
c ~ 1.18304060 ... at n = 32 and decreases slowly thereafter. 
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